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<110> APPLICANT: 


Wilkins, Thea A. 
















4 




The Regents of 


: the University of California 








D 


<120> TITLE 


OF INVENTION: Cotton Transcription Factors i 


and Thei; 


Q 
O 


<130> FILE REFERENCE 


:: 02307O-095600US 














1 n 


<140> CURRENT APPLICATION NUMBER: US 09/453,387 










1 X 


<141> CURRENT FILING 


; DATE: : 


L999- 


-12-02 














1 T 
±J 


<160> NUMBER OF 


SEQ 


ID ] 


SfOS: 


26 


















1 

1 0 


<170> SOFTWARE: 


Patentin Ver. 2. 


1 
















1 1 
1 / 


<210> SEQ ID NO: 


1 
























io 


<211> LENGTH: 1006 
























1 Q 


<212> TYPE: 


DNA 


























o rt 
^lU 


<213> ORGANISM: 


Gossypium hirsutum 


















<220> FEATURE: 




























<221> NAME/KEY: 


CDS 


























<222> LOCATION: 


(59) 


. . (943) 






















<223> OTHER 


INFORMATION 


: GhMYBl 


















97 


<400> SEQUENCE: 


1 
























OQ 

£. O 


taacaccgtt attctttctc tattctacct 


: gatttgattt 


gatttgattt i 


bgtaactg 


JU 


atg 


gga 


cga 


tea 


cet 


tgt 


tgt 


gaa 


aag 


get 


cat 


ace 


aac 


aaa 


ggt 


gee 


o X 


Met 


Gly 


Arg 


Ser 


Pro 


Cys 


Cys 


Glu 


Lys 


Ala 


His 


Thr 


Asn 


Lys 


Gly 


Ala 




1 








5 










10 










15 




T /I 


tgg 


acc 


aaa 


gag 


gaa 


gat 


caa 


cgc 


etc 


ate 


aac 


tac 


ate 


egt 


gtc 


eat 




Trp 


Thr 


Lys 


Glu 


Glu 


Asp 


Gin 


Arg 


Leu 


He 


Asn 


Tyr 


He 


Arg 


Val 


His 










20 










25 










30 






Q 
JO 


ggt 


gaa 


ggc 


tge 


tgg 


egt 


tec 


etc 


cec 


aaa 


get 


get 


ggg 


etg 


ctt 


aga 




Gly 


Glu 


Gly 


Cys 


Trp 


Arg 


Ser 


Leu 


Pro 


Lys 


Ala 


Ala 


Gly 


Leu 


Leu 


Arg 


/I n 
4 U 






35 










40 










45 








A O 


tgt 


ggt 


aag 


agt 


tge 


aga 


tta 


aga 


tgg 


ata 


aac 


tac 


ttg 


agg 


cet 


gat 


4 J 


Cys 


Gly 


Lys 


Ser 


Cys 


Arg 


Leu 


Arg 


Trp 


He 


Asn 


Tyr 


Leu 


Arg 


Pro 


Asp 


44 




50 










55 










60 










4 D 


ctt 


aag 


aga 


gga 


aat 


ttc 


act 


gaa 


gaa 


gaa 


gat 


gag 


ctt 


ate 


ate 


aag 


4 / 


Leu 


Lys 


Arg 


Gly 


Asn 


Phe 


Thr 


Glu 


Glu 


Glu 


Asp 


Glu 


Leu 


He 


He 


Lys 


/I Q 
4 o 


65 










70 










75 










80 


50 


ctt 


cac 


agt 


tta 


ctt 


gga 


aac 


aaa 


tgg 


tea 


ttg 


att 


get 


gga 


aga 


tta 


51 


Leu 


His 


Ser 


Leu 


Leu 


Gly 


Asn 


Lys 


Trp 


Ser 


Leu 


He 


Ala 


Gly 


Arg 


Leu 


52 










85 










90 










95 




54 


cca 


gga 


aga 


aca 


gat 


aat 


gag 


ata 


aag 


aac 


tac 


tgg 


aac 


aca 


cac 


ate 


55 


Pro 


Gly 


Arg 


Thr 


Asp 


Asn 


Glu 


He 


Lys 


Asn 


Tyr 


Trp 


Asn 


Thr 


His 


He 


56 








100 










105 










110 






58 


aaa 


aga 


aag 


ctt 


ata 


age 


aga 


gga 


att 


gat 


eca 


caa 


act 


cat 


egt 


cet 


59 


Lys 


Arg 


Lys 


Leu 


He 


Ser 


Arg 


Gly 


He 


Asp 


Pro 


Gin 


Thr 


His 


Arg 


Pro 


60 






115 










120 










125 








62 


etc 


aat 


caa 


acg 


gee 


aat 


ace 


aac 


aca 


gtc 


aca 


gee 


cec 


ace 


gaa 


ttg 


63 


Leu 


Asn 


Gin 


Thr 


Ala 


Asn 


Thr 


Asn 


Thr 


Val 


Thr 


Ala 


Pro 


Thr 


Glu 


Leu 


64 




130 










135 










140 










66 


gat 


ttc 


aga 


aac 


teg 


cec 


aca 


tec 


gtt 


tec 


aaa 


tec 


agt 


tec 


ate 


aaa 


67 


Asp 


Phe 


Arg 


Asn 


Ser 


Pro 


Thr 


Ser 


Val 


Ser 


Lys 


Ser 


Ser 


Ser 


He 


Lys 




58 
106 



154 



202 



250 



298 



346 



394 



442 



490 



538 
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DO 












1 










LOO 










X dU 




70 

/ \j 


a a 




L L> L. 




y d L. 




dd L, 


L d 0 


dd u 


i^a a 
y dd 


U L L. 


caa 




aag 


tec 


aac 


^ p ^ 

0 0 D 




Asn 


Pro 


Ser 


Leu 


Asp 


rne 


Asn 


Tyr 


Asn 


Pill 
bXU 


rne 


P 1 r» 

bxn 


rne 


Lys 


Ser 


n « *-i 

Asn 














J. DO 










170 
X / U 










1 / D 






1 A 




^ d L. 


i- r> 




y dd 


y dd 




a a 
d d 0 


f- nf- 


dL'd 


gee 


age 


a rr+- 

dg t 


ggc 


a L.y 


a/-»4- 
dCL. 


Do ft 




inr 


Asp 


Q V 

ber 


Leu 


P T n 


Pin 


Pro 


Asn 


^ys 


1 nr 


^ixa 


Ser 


Ser 


P 1 tr 

bxy 


Met 


inr 












1 0 u 










1 R ^ 










X y u 








7ft 




d L. 


y d d 


ydy 


^d d 


oa a 
v^dd 


y dd 


a rr 
Ody 


0 L y 


Od 0 


a a 
ddy 


a a 
ddy 


eag 


caa 


•1- a 
1- dC 


ggt 


^ft 9 
D 0 Z 


1 Q 


i ilJ. 


Hsp 


o±U 


Pin 


Pin 

oxn 


Pin 

0 j.n 


PI n 

olU 


P 1 r\ 

vjXn 


Leu 


nXS 


Lys 


Lys 


Pin 
bxn 


Pin 

tjxn 


Tyr 


P 1 T r 

bxy 




ft n 

o u 






1 

± 17 J 










900 










90*^ 










ft 9 




dy 0 


dd u 


rtrrrt 

gyg 


Odd 


y d L. 


d L d 


dd L. 


rtg 


gag 


ct g 


teg 


ah +- 
d L U 




a u L 


gtt 


7 0 

/ 0 U 


ft ? 






Zi c 




p;i n 




T 1 0 
X J- tr 


Zi er^ 


Leu 


P 1 n 
uXU 


Leu 


oer 


T 1 0 

xxe 


PI \T 

oxy 


T 1 0 

xxe 


\Za 1 

vax 




ft4 




9 1 n 

Z X u 










91 ^ 
ID 










990 












ft 

O D 


L Ud 


get 


gac 


L L-d 




egg 


y Ld 


uCd 


a a+- 
dd U 


gee 


aae 


teg 


gee 


gag 


teg 


aaa 


T7 Q 


ft 7 
o / 




-rt.±d 










\7a 1 
V d X 


C 0 V 


Z\ 0 n 


ZXl a 


/\sn 


oer 


zx 1 a 
Hxa 


PI n 
oXU 


oer 


Lys 




88 


225 




















0 T 
Z J D 










z 4 U 




90 


cca 


aag 


gta 


gat 


aac 


aac 


aat 


tte 


eag 


ttt 


Ctt 


gaa 


eaa 


get 


atg 


gtg 


ft 9 ^ 
0 Z D 


91 


Pro 


Lys 


Val 


Asp 


Asn 


Asn 


Asn 


Phe 


Gin 


Phe 


Leu 


Glu 


Gin 


Ala 


Met 


Val 




92 










245 










250 










255 






94 


get 


aag 


gcg 


gta 


tgt 


ttg 


tgt 


tgg 


eaa 


tta 


ggt 


ttt 


gga 


aca 


agt 


gaa 


874 


95 


Ala 


Lys 


Ala 


Val 


Cys 


Leu 


Cys 


Trp 


Gin 


Leu 


Gly 


Phe 


Gly 


Thr 


Ser 


Glu 




96 








260 










265 










270 








98 


att 


tgt 


agg 


aac 


tgt 


caa 


aat 


tea 


aat 


tea 


aat 


ggc 


tte 


tat 


agt 


tat 


922 


99 


He 


Cys 


Arg 


Asn 


Cys 


Gin 


Asn 


Ser 


Asn 


Ser 


Asn 


Gly 


Phe 


Tyr 


Ser 


Tyr 




100 




275 








280 








285 








102 


: tgt 


: aga ccc ttg gat 


: tea tag ggteatcttt 


ttcttcttte tttetgtttt 


973 


103 


1 Cys 


i Arg Pro Leu Asp 


) Ser 























W~> 104 290 295 

106 taggagataa attaatgett aattattaaa aaa 1006 

109 <210> SEQ ID NO: 2 

110 <211> LENGTH: 294 

111 <212> TYPE: PRT 

112 <213> ORGANISM: Gossypium hirsutum 
114 <400> SEQUENCE: 2 



115 


Met 


Gly 


Arg 


Ser 


Pro 


Cys 


Cys 


Glu 


Lys 


Ala 


His 


Thr 


Asn 


Lys 


Gly Ala 


116 


1 








5 










10 










15 


117 


Trp 


Thr 


Lys 


Glu 


Glu 


Asp 


Gin 


Arg 


Leu 


He 


Asn 


Tyr 


He 


Arg 


Val His 


118 








20 










25 










30 




119 


Gly 


Glu 


Gly 


Cys 


Trp 


Arg 


Ser 


Leu 


Pro 


Lys 


Ala 


Ala 


Gly Leu 


Leu Arg 


120 






35 










40 










45 






121 


Cys 


Gly 


Lys 


Ser 


Cys 


Arg 


Leu 


Arg 


Trp 


He 


Asn 


Tyr 


Leu 


Arg 


Pro Asp 


122 




50 










55 










60 








123 


Leu 


Lys 


Arg 


Gly 


Asn 


Phe 


Thr 


Glu 


Glu 


Glu Asp 


Glu 


Leu 


He 


He Lys 


124 


65 










70 










75 








80 


125 


Leu 


His 


Ser 


Leu 


Leu 


Gly 


Asn 


Lys 


Trp 


Ser 


Leu 


He 


Ala 


Gly Arg Leu 


126 










85 










90 










95 


127 


Pro 


Gly 


Arg 


Thr 


Asp 


Asn 


Glu 


He 


Lys 


Asn 


Tyr 


Trp Asn 


Thr 


His He 


128 








100 










105 










110 




129 


Lys 


Arg 


Lys 


Leu 


He 


Ser 


Arg 


Gly 


He 


Asp 


Pro 


Gin 


Thr 


His 


Arg Pro 
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130 115 120 125 

131 Leu Asn Gin Thr Ala Asn Thr Asn Thr Val Thr Ala Pro Thr Glu Leu 

132 130 135 140 

133 Asp Phe Arg Asn Ser Pro Thr Ser Val Ser Lys Ser Ser Ser lie Lys 

134 145 150 155 ' 160 

135 Asn Pro Ser Leu Asp Phe Asn Tyr Asn Glu Phe Gin Phe Lys Ser Asn 

136 165 170 175 

137 Thr Asp Ser Leu Glu Glu Pro Asn Cys Thr Ala Ser Ser Gly Met Thr 

138 180 185 190 

139 Thr Asp Glu Glu Gin Gin Glu Gin Leu His Lys Lys Gin Gin Tyr Gly 

140 195 200 205 

141 Pro Ser Asn Gly Gin Asp He Asn Leu Glu Leu Ser He Gly He Val 

142 210 215 220 

14 3 Ser Ala Asp Ser Ser Arg Val Ser Asn Ala Asn Ser Ala Glu Ser Lys 

144 225 230 235 240 

14 5 Pro Lys Val Asp Asn Asn Asn Phe Gin Phe Leu Glu Gin Ala Met Val 

146 245 250 255 

147 Ala Lys Ala Val Cys Leu Cys Trp Gin Leu Gly Phe Gly Thr Ser Glu 

148 260 265 270 

14 9 He Cys Arg Asn Cys Gin Asn Ser Asn Ser Asn Gly Phe Tyr Ser Tyr 

150 275 280 285 

151 Cys Arg Pro Leu Asp Ser 

152 290 

156 <210> SEQ ID NO: 3 

157 <211> LENGTH: 1151 

158 <212> TYPE: DNA 

159 <213> ORGANISM: Gossypium hirsutum 

161 <220> FEATURE: 

162 <221> NAME/KEY: CDS 

163 <222> LOCATION: (72).. (752) 

164 <223> OTHER INFORMATION: GhMYB6 

166 <400> SEQUENCE: 3 

167 cggattttct ttccccgtgt ttggttgcac agaaagtgag agaaagtttt acttttgatt 60 

169 ttgaaactcc g atg aga aaa cct tgc tgc gat aaa caa ggc acc aac aag 110 

170 Met Arg Lys Pro Cys Cys Asp Lys Gin Gly Thr Asn Lys 

171 15 10 



173 


gga 


gcc 


tgg 


tec 


aag 


caa 


gaa 


gat 


caa 


aag 


etc 


att 


gat 


tat 


ata 


cgt 


158 


174 


Gly 


Ala 


Trp 


Ser 


Lys 


Gin 


Glu 


Asp 


Gin 


Lys 


Leu 


He 


Asp 


Tyr 


He 


Arg 




175 




15 










20 










25 












177 


att 


cat 


ggt 


gaa 


ggc 


tgt 


tgg 


cgt 


tec 


etc 


ecc 


aaa 


get 


gea 


ggt 


ttg 


206 


178 


He 


His 


Gly 


Glu 


Gly 


Cys 


Trp 


Arg 


Ser 


Leu 


Pro 


Lys 


Ala 


Ala 


Gly 


Leu 




179 


30 










35 










40 










45 




181 


cac 


cgt 


tgc 


ggt 


aaa 


agt 


tgc 


agg 


ctg 


aga 


tgg 


ata 


aat 


tac 


tta 


aga 


254 


182 


His 


Arg 


Cys 


Gly 


Lys 


Ser 


Cys 


Arg 


Leu 


Arg 


Trp 


He 


Asn 


Tyr 


Leu 


Arg 




183 










50 










55 










60 






185 


cca 


gat 


ate 


aaa 


cgt 


ggt 


aac 


ttt 


get 


caa 


gac 


gaa 


gag 


gac 


tta 


att 


302 


186 


Pro 


Asp 


He 


Lys 


Arg 


Gly 


Asn 


Phe 


Ala 


Gin 


Asp 


Glu 


Glu 


Asp 


Leu 


He 




187 








65 










70 










75 








189 


ate 


aaa 


etc 


cat 


get 


etc 


ctt 


ggt 


aac 


egg 


tgg 


tea 


ctg 


ata 


get 


ggt 


350 
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190 


lie 


Lys 


Leu 


His 


Ala 


Leu 


Leu 


Gly 


Asn 


Arg 


Trp 


Ser 


Leu 


He 


Ala 


Gly 




lyi 






o n 
oU 










00 










yu 










1 y J 




+- -t- =1 
tea 


cca 


gga 


aga 


aca 


gau 


a =1 4- 

aai. 


gaa 


gtg 


aag 


aac 


4- 4- 

uar 


tgg 


o 3 4- 

aar 


tee 


oyd 




Arg 


Leu 


Pro 


bxy 


Arg 


inr 


Asp 


Asn 


blU 


vai 


Lys 


Asn 


Tyr 


Trp 


Asn 


ber 








yo 










inn 
iUU 










1 n c 
lUO 












1 QT 

1 y / 




a ta 


aag 


aga 


aag 


era 


3 4- /-T 

arg 


aag 


4- /xr 

atg 




ate 


gar 


cct 


4- 

aat 


aac 


eat 


4 4 o 


lyo 


His 


He 


Lys 


Arg 


Lys 


Leu 


Met 


Lys 


Meu 


Gly 


T 1 

lie 


Asp 


Pro 


Asn 


Asn 


His 




ly y 


110 










lie; 

1 Id 










ion 
IZO 










IOC 

lz5 




o m 
zUi 


aag 


ttg 


aac 


caa 


tat 


cct 


ca u 


cat 


gu L 


ggt 


CCC 


ctt 


aac 


ecc 


acc 


ace 


A d. A 

4 94 


o r» o 

202 


Lys 


Leu 


Asn 


Gin 


Tyr 


Pro 


His 


His 


Val 


Gly 


Pro 


Leu 


Asn 


Pro 


Thr 


Thr 




203 










130 










135 










140 






O A C 


acc 


aac 


tec 


atg 


gat 


gtg 


gca 


tgt 


aag 


ctt 


aga 


gtg 


tgt 


tea 


aca 


gae 


542 


206 


Thr 


Asn 


Ser 


Met 


Asp 


Val 


Ala 


Cys 


Lys 


Leu 


Arg 


Val 


Cys 


Ser 


Thr 


Asp 




207 








145 










150 










155 








o n Q 


aat 


gat 


gat 


ggg 


ate 


tea 


gat 


get 


gca 


agt 


tat 


etc 


gaa 


gae 


gca 


aca 


con 
590 


OTA 

210 


Asn Asp Asp 


Gly 


He 


Ser 


Asp 


Ala 


TV 1 — 

Ala 


Ser 


Tyr 


Leu 


Glu 


Asp 


Ala 


Thr 




211 






160 










165 










170 










9 1 


ccg 


ccc 


act 


ggt 


aua 


tec 


aac 


ttg 


gae 


ctt 


gat 


etc 


aca 


att 


get 


4-4-4- 


D,30 


214 


Pro 


Pro 


Thr 


Gly 


He 


Ser 


Asn 


Leu 


Asp 


Leu 


Asp 


Leu 


Thr 


He 


Ala 


Phe 




2 Id 




175 










180 










185 












Z 1 / 


cct 


teg 


agt 


cct 


ate 


aag 


aat 


att 


att 


gaa 


gaa 


age 


eag 


cag 


aaa 


aca 


DO D 


91 P 
Z 1 0 


Pro 


Ser 


Ser 


Pro 


He 


Lys 


Asn 


He 


He 


Glu 


Glu 


Ser 


Gin 


Gin 


Lys 


Thr 




9 1 Q 

z 1 y 


190 










195 










200 










205 




9 91 
ZZ 1 


gca 


tct 


att 


gta 


aca 


aat 


gat 


gaa 


gaa 


gaa 


caa 


tat 


aca 


gte 


cct 


ace 


/ 04t 


9 9 9 
ZZZ 


Ala 


Ser 


He 


Val 


Thr 


Asn 


Asp 


Glu 


Glu 


Glu 


Gin 


Tyr 


Thr 


Val 


Pro 


Thr 




9 9 "5 
ZZO 










210 










215 










220 






o o c 
ZZO 


ctt 


ctt 


ctt 


ttc 


aga 


tga 


gaeaaaaaaa aaagcetcac aeatgtggag 




TOO 

782 


o o 

2zo 


Leu 


Leu 


Leu 


Phe 


Arg 


























O O T 

227 








225 




























o o r\ 

229 


attcgtgcaa * 


aagacctaaa ggcttacgaa ggcaacatgc 


aegeeattgt 


caaattcttt 


842 


zoi 


tggatgatgg « 


attgaaacca tatccttgtc cattagaaag 


gaggaagata 


agetaaaact 


ft ri o 
902 


O "0 "3 

z J J 


gtattattgt < 


gtataaattt ggtagaaaga aagatttcaa 


cttaagaatt 


aggatcaaat 


n /" o 

962 


O T C 


aactgaatga i 


atgaacgaat tgcagataag ttgttaggag 


gtttteaate 


aacttatctg 


T n o o 


237 


caattaattt < 


ggtggagctg atgtaggatg atgagttcat 


cgtaeatgaa 


ctgaaccttt 


1082 


o o n 

2oy 


gatatttcag < 


gctctaattg tctgtttgta tgcgtaaaga 


tattettcaa 


tgtgagatca 


T 1 J* O 

1142 


241 


gctaaaaaa 




























1151 


24 4 


<210> SEQ ID NO 


4 


























245 


<211> LENGTH: 226 


























24 6 


<212> TYPE: 


PRT 




























247 


<213> ORGANISM: 


Gossypium hirsutum 


















249 


<4 00> SEQUENCE: 


4 


























250 


Met 


Arg 


Lys 


Pro 




Cys 


Asp 


Lys 


Gin 


Gly Thr 


Asn 


Lys 


Gly Ala 


Trp 




251 


1 








5 










10 










15 






252 


Ser 


Lys 


Gin 


Glu 


Asp 


Gin 


Lys 


Leu 


He 


Asp 


Tyr 


He 


Arg 


He 


His 


Gly 




253 








20 










25 










30 








254 


Glu 


Gly Cys 


Trp Arg 


Ser 


Leu 


Pro 


Lys 


Ala 


Ala 


Gly Leu 


His 


Arg 


Cys 




255 






35 










40 










45 










256 


Gly Lys 


Ser 


Cys Arg 


Leu Arg 


Trp 


He 


Asn 


Tyr 


Leu Arg 


Pro Asp 


He 




257 




50 










55 










60 
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258 


Lys Arg 


Gly Asn 


Phe Ala 


Gin 


Asp 


Glu Glu Asp 


Leu 


He 


He 


Lys 


Leu 


259 


65 




70 










75 










80 


260 


His Ala 


Leu Leu 


Gly Asn 


Arg 


Trp 


Ser 


Leu 


He 


Ala 


Gly 


Arg 


Leu 


Pro 


261 






85 








90 










95 




262 


Gly Arg 


Thr Asp Asn Glu 


Val 


Lys 


Asn 


Tyr 


Trp 


Asn 


Ser 


His 


He 


Lys 


263 




100 








105 










110 






264 


Arg Lys 


Leu Met 


Lys Met 


Gly 


He 


Asp 


Pro 


Asn 


Asn 


His 


Lys 


Leu 


Asn 


265 




115 






120 










125 








266 


Gin Tyr 


Pro His 


His Val 


Gly 


Pro 


Leu 


Asn 


Pro 


Thr 


Thr 


Thr 


Asn 


Ser 


267 


130 






135 










140 










268 


Met Asp Val Ala 


Cys Lys 


Leu 


Arg 


Val 


Cys 


Ser 


Thr 


Asp 


Asn 


Asp 


Asp 


269 


145 




150 










155 










160 


270 


Gly He 


Ser Asp Ala Ala 


Ser 


Tyr 


Leu 


Glu Asp 


Ala 


Thr 


Pro 


Pro 


Thr 


271 






165 








170 










175 




272 


Gly He 


Ser Asn 


Leu Asp 


Leu 


Asp 


Leu 


Thr 


He 


Ala 


Phe 


Pro 


Ser 


Ser 


273 




180 








185 










190 






274 


Pro He 


Lys Asn 


He He 


Glu 


Glu 


Ser 


Gin 


Gin 


Lys 


Thr 


Ala 


Ser 


He 


275 




195 






200 










205 








276 


Val Thr 


Asn Asp 


Glu Glu 


Glu 


Gin 


Tyr 


Thr 


Val 


Pro 


Thr 


Leu 


Leu 


Leu 


277 


210 






215 










220 











278 Phe Arg 

279 225 

283 <210> SEQ ID NO: 5 

284 <211> LENGTH: 1081 

285 <212> TYPE: DNA 

286 <213> ORGANISM: Gossypium hirsutum 

288 <220> FEATURE: 

289 <221> NAME/KEY: CDS 

290 <222> LOCATION: (86).. (994) 

291 <223> OTHER INFORMATION: GhMYB7 

293 <400> SEQUENCE: 5 

294 ctccccgcgg tggcggccgc tctagaacta gtggatcccc cgggctgcag gaattcggca 60 



296 


cgaggaaaga agtgtgaaaa aaaaa 


atg 


gga 


agg 


agt 


ect 


tgt 


tgt 


tet 


aag 


112 


297 
















Met 


Gly 


Arg 


Ser 


Pro 


Cys 


Cys 


Ser 


Lys 




298 
















1 








5 












300 


gaa 


ggc 


ctt 


aac 


aga 


gga 


get 


tgg 


act 


get 


ett 


gaa 


gac 


aaa 


att 


ctt 


160 


301 


Glu 


Gly 


Leu 


Asn 


Arg 


Gly 


Ala 


Trp 


Thr 


Ala 


Leu 


Glu 


Asp 


Lys 


He 


Leu 




302 


10 










15 










20 










25 




304 


aaa 


gat 


tat 


ate 


aaa 


gta 


cae 


ggt 


gaa 


ggt 


egt 


tgg 


aga 


aat 


etc 


ccc 


208 


305 


Lys 


Asp 


Tyr 


He 


Lys 


Val 


His 


Gly 


Glu 


Gly 


Arg 


Trp 


Arg 


Asn 


Leu 


Pro 




306 










30 










35 










40 






308 


aaa 


aga 


get 


ggt 


ctt 


aag 


aga 


tgt 


ggg 


aaa 


agt 


tgt 


agg 


ctt 


egg 


tgg 


256 


309 


Lys 


Arg 


Ala 


Gly 


Leu 


Lys 


Arg 


Cys 


Gly 


Lys 


Ser 


Cys 


Arg 


Leu 


Arg 


Trp 




310 








45 










50 










55 








312 


ttg 


aat 


tat 


ttg 


aga 


cct 


gat 


att 


aaa 


aga 


ggt 


aac 


ata 


tea 


cct 


gac 


304 


313 


Leu 


Asn 


Tyr 


Leu 


Arg 


Pro 


Asp 


He 


Lys 


Arg 


Gly 


Asn 


He 


Ser 


Pro 


Asp 




314 






60 










65 










70 










316 


gag 


gaa 


gag 


ctt 


ate 


ate 


aaa 


etc 


cae 


aaa 


etc 


ttg 


gga 


aac 


aga 


tgg 


352 


317 


Glu 


Glu 


Glu 


Leu 


He 


He 


Lys 


Leu 


His 


Lys 


Leu 


Leu 


Gly 


Asn 


Arg 


Trp 





Please Note: 

Use of n and7oiO(aa~liave^Been qu€nGe4^istina. Rlease,reyi0w the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least ne n or Xaa. 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/453 , 387 



DATE: 06/06/2001 
TIME: 14:02:05 



Input Set : A:\Uco-956.app 

Output Set: C:\CRF3\06062001\I453387.raw 



L:104 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:1 

L:447 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:7. 

L:575 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:9 

L:593 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:10 

L:798 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:25 

L:827 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:26 
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